AITopics | Harrison County

Collaborating Authors

Harrison County

Unmanned Surface Vehicle Path Planning from the Perspective of Multi-Modality Constraints: A Comprehensive Analysis

Zhou, Chunhui, Gu, Shangding, Wen, Yuanqiao, Du, Zhe, Xiao, Changshi, Huang, Liang, Zhu, Man

arXiv.org Artificial IntelligenceMar-8-2025

With the development and application of artificial intelligence and machine learning, more and more studies focus on unmanned vehicles and their applications (Zhou, Z., 2016). For example, Unmanned Ground Vehicle (UGV) or wheeled robot is widely used in field of industrial automation (automatic forklift), warehouse management, planet exploring (lunar rover), disaster rescue, intelligent transportation (automatic drive) and military operation (de-mining robot) (Arai et al., 2002; Farinelli et al., 2004; Kui et al., 2007). The application of Unmanned Aerial Vehicle (UAV) is also increasingly changed from military domain to civil use, such as remote sensing photographing, agricultural spraying, communications relay, environmental monitoring and express service (Jayoung et al., 2013; George et al., 2012; Mingzhu et al., 2016). The development of UGV and UAV has already been updated to a new level. Another unmanned vehicle should also be paid attention to, which is the Unmanned Surface Vehicle (USV). The application scenarios are not widely applied for civil use and the studies of a USV are relatively fewer and commence a bit late.

algorithm, constraint, path planning, (14 more...)

arXiv.org Artificial Intelligence

2007.01691

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > China > Hubei Province > Wuhan (0.05)
(21 more...)

Genre: Research Report (0.82)

Industry:

Transportation (1.00)
Government > Military (1.00)
Information Technology (0.87)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback

Self-Supervised Learning-Based Path Planning and Obstacle Avoidance Using PPO and B-Splines in Unknown Environments

Shokouhi, Shahab, Oruc, Oguzhan, Thein, May-Win

arXiv.org Artificial IntelligenceDec-3-2024

This paper introduces SmartBSP, an advanced self-supervised learning framework for real-time path planning and obstacle avoidance in autonomous robotics navigating through complex environments. The proposed system integrates Proximal Policy Optimization (PPO) with Convolutional Neural Networks (CNN) and Actor-Critic architecture to process limited LIDAR inputs and compute spatial decision-making probabilities. The robot's perceptual field is discretized into a grid format, which the CNN analyzes to produce a spatial probability distribution. During the training process a nuanced cost function is minimized that accounts for path curvature, endpoint proximity, and obstacle avoidance. Simulations results in different scenarios validate the algorithm's resilience and adaptability across diverse operational scenarios. Subsequently, Real-time experiments, employing the Robot Operating System (ROS), were carried out to assess the efficacy of the proposed algorithm.

algorithm, obstacle, path planning, (15 more...)

arXiv.org Artificial Intelligence

2412.02176

Country:

North America > United States > New Hampshire (0.05)
North America > United States > South Carolina > Charleston County (0.04)
North America > United States > Mississippi > Harrison County > Biloxi (0.04)
(3 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Encouraging Responsible Use of Generative AI in Education: A Reward-Based Learning Approach

Singh, Aditi, Ehtesham, Abul, Kumar, Saket, Gupta, Gaurav Kumar, Khoei, Tala Talaei

arXiv.org Artificial IntelligenceJun-26-2024

This research introduces an innovative mathematical learning approach that integrates generative AI to cultivate a structured learning rather than quick solution. Our method combines chatbot capabilities and generative AI to offer interactive problem-solving exercises, enhancing learning through a stepby-step approach for varied problems, advocating for the responsible use of AI in education. Our approach emphasizes that immediate answers from ChatGPT can impede real learning. We introduce a reward-based system that requires students to solve mathematical problems effectively to receive the final answer. This encourages a progressive learning path from basic to complex problems, rewarding mastery with final solutions. The goal is to transition students from seeking quick fixes to engaging actively in a comprehensive learning experience.

chatgpt, mega, student, (16 more...)

arXiv.org Artificial Intelligence

2407.15022

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Virginia > Loudoun County > Sterling (0.04)
North America > United States > Mississippi > Harrison County > Biloxi (0.04)
(4 more...)

Genre:

Instructional Material (0.69)
Research Report (0.50)

Industry:

Education > Curriculum > Subject-Specific Education (0.69)
Education > Educational Setting > K-12 Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.93)

Add feedback

3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification

Varahagiri, Shyam, Sinha, Aryaman, Dubey, Shiv Ram, Singh, Satish Kumar

arXiv.org Artificial IntelligenceApr-19-2024

In recent years, Vision Transformers (ViTs) have shown promising classification performance over Convolutional Neural Networks (CNNs) due to their self-attention mechanism. Many researchers have incorporated ViTs for Hyperspectral Image (HSI) classification. HSIs are characterised by narrow contiguous spectral bands, providing rich spectral data. Although ViTs excel with sequential data, they cannot extract spectral-spatial information like CNNs. Furthermore, to have high classification performance, there should be a strong interaction between the HSI token and the class (CLS) token. To solve these issues, we propose a 3D-Convolution guided Spectral-Spatial Transformer (3D-ConvSST) for HSI classification that utilizes a 3D-Convolution Guided Residual Module (CGRM) in-between encoders to "fuse" the local spatial and spectral information and to enhance the feature propagation. Furthermore, we forego the class token and instead apply Global Average Pooling, which effectively encodes more discriminative and pertinent high-level features for classification. Extensive experiments have been conducted on three public HSI datasets to show the superiority of the proposed model over state-of-the-art traditional, convolutional, and Transformer models. The code is available at https://github.com/ShyamVarahagiri/3D-ConvSST.

classification, remote sensing, transformer, (14 more...)

arXiv.org Artificial Intelligence

2404.13252

Country:

Africa > Botswana (0.07)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
Asia > India (0.04)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

COLA: Characterizing and Optimizing the Tail Latency for Safe Level-4 Autonomous Vehicle Systems

Liu, Haolan, Wang, Zixuan, Zhao, Jishen

arXiv.org Artificial IntelligenceMay-11-2023

Autonomous vehicles (AVs) are envisioned to revolutionize our life by providing safe, relaxing, and convenient ground transportation. The computing systems in such vehicles are required to interpret various sensor data and generate responses to the environment in a timely manner to ensure driving safety. However, such timing-related safety requirements are largely unexplored in prior works. In this paper, we conduct a systematic study to understand the timing requirements of AV systems. We focus on investigating and mitigating the sources of tail latency in Level-4 AV computing systems. We observe that the performance of AV algorithms is not uniformly distributed -- instead, the latency is susceptible to vehicle environment fluctuations, such as traffic density. This contributes to burst computation and memory access in response to the traffic, and further leads to tail latency in the system. Furthermore, we observe that tail latency also comes from a mismatch between the pre-configured AV computation pipeline and the dynamic latency requirements in real-world driving scenarios. Based on these observations, we propose a set of system designs to mitigate AV tail latency. We demonstrate our design on widely-used industrial Level-4 AV systems, Baidu Apollo and Autoware. The evaluation shows that our design achieves 1.65 X improvement over the worst-case latency and 1.3 X over the average latency, and avoids 93% of accidents on Apollo.

artificial intelligence, latency, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2305.07147

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > China (0.04)
North America > United States > Mississippi > Harrison County > Biloxi (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Information Technology (1.00)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Augmented Language Models: a Survey

Mialon, Grégoire, Dessì, Roberto, Lomeli, Maria, Nalmpantis, Christoforos, Pasunuru, Ram, Raileanu, Roberta, Rozière, Baptiste, Schick, Timo, Dwivedi-Yu, Jane, Celikyilmaz, Asli, Grave, Edouard, LeCun, Yann, Scialom, Thomas

arXiv.org Artificial IntelligenceFeb-15-2023

This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in calling external modules such as a code interpreter. LMs can leverage these augmentations separately or in combination via heuristics, or learn to do so from demonstrations. While adhering to a standard missing tokens prediction objective, such augmented LMs can use various, possibly non-parametric external modules to expand their context processing ability, thus departing from the pure language modeling paradigm. We therefore refer to them as Augmented Language Models (ALMs). The missing token objective allows ALMs to learn to reason, use tools, and even act, while still performing standard natural language tasks and even outperforming most regular LMs on several benchmarks. In this work, after reviewing current advance in ALMs, we conclude that this new research direction has the potential to address common limitations of traditional LMs such as interpretability, consistency, and scalability issues.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2302.07842

Country:

Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
(6 more...)

Genre: Overview (1.00)

Industry:

Education (1.00)
Leisure & Entertainment > Games (0.67)
Information Technology > Services (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(4 more...)

Add feedback

Momentum Decoding: Open-ended Text Generation As Graph Exploration

Lan, Tian, Su, Yixuan, Liu, Shuhang, Huang, Heyan, Mao, Xian-Ling

arXiv.org Artificial IntelligenceDec-5-2022

Open-ended text generation with autoregressive language models (LMs) is one of the core tasks in natural language processing. However, maximization-based decoding methods (e.g., greedy/beam search) often lead to the degeneration problem, i.e., the generated text is unnatural and contains undesirable repetitions. Existing solutions to this problem either introduce randomness prone to incoherence or require a look-ahead mechanism that demands extra computational overhead. In this study, we formulate open-ended text generation from a new perspective, i.e., we view it as an exploration process within a directed graph. Thereby, we understand the phenomenon of degeneration as circular loops within the directed graph. Based on our formulation, we propose a novel decoding method -- \textit{momentum decoding} -- which encourages the LM to \textit{greedily} explore new nodes outside the current graph. Meanwhile, it also allows the LM to return to the existing nodes with a momentum downgraded by a pre-defined resistance function. We extensively test our approach on three benchmarks from different domains through automatic and human evaluations. The results show that momentum decoding performs comparably with the current state of the art while enjoying notably improved inference speed and computation FLOPs. Furthermore, we conduct a detailed analysis to reveal the merits and inner workings of our approach. Our codes and other related resources are publicly available at https://github.com/gmftbyGMFTBY/MomentumDecoding.

artificial intelligence, momentum, natural language, (19 more...)

arXiv.org Artificial Intelligence

2212.02175

Country:

Europe > Slovakia (0.05)
Europe > Italy (0.04)
Europe > Russia (0.04)
(19 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Law (0.68)
Leisure & Entertainment > Sports > Soccer (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

Add feedback

Shared Manifold Learning Using a Triplet Network for Multiple Sensor Translation and Fusion with Missing Data

Dutt, Aditya, Zare, Alina, Gader, Paul

arXiv.org Artificial IntelligenceOct-25-2022

Abstract--Heterogeneous data fusion can enhance the robustness and accuracy of an algorithm on a given task. However, due to the difference in various modalities, aligning the sensors and embedding their information into discriminative and compact representations is challenging. In this paper, we propose a Contrastive learning based MultiModal Alignment Network (CoMMANet) to align data from different sensors into a shared and discriminative manifold where class information is preserved. The proposed architecture uses a multimodal triplet autoencoder to cluster the latent space in such a way that samples of the same classes from each heterogeneous modality are mapped close to each other. Since all the modalities exist in a shared manifold, a unified classification framework is proposed. A comparison made with other methods demonstrates the superiority of this method. This method is also called decision fusion. In the context of a neural network, these outstanding results on tasks like land-use and land-cover representations are generated by the convolutional layers classification (LULC) [1] [2], mineral exploration [3] [4] and fused gradually to form a shared representation [5], urban planning [6], biodiversity conservation [7], sentiment layer. In Fusion methods can be classified into two groups: concatenation and alignment-based methods. Personal use of this material is permitted. To increase the interpretability learn spatial information by using a structured morphological of fusion models, Hong et al. [27] proposed a element of predefined size and shape. They proposed a graphbased shared and specific feature learning (S2FL) that is capable of model to couple the dimension reduction and fusion of decomposing data into modality-shared and modality-specific information. However, using this method, the cloud-covered components, which enables a better information blending of regions are not accurately classified because the morphological multiple heterogeneous modalities.

artificial intelligence, machine learning, sensor, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/JSTARS.2022.3217485

2210.17311

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
North America > United States > Alaska > Denali Borough > Healy (0.04)
Asia > Middle East > Syria > Daraa Governorate > Dar'a (0.04)

Genre: Research Report (1.00)

Industry:

Education (0.82)
Government > Military (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

PEER: A Collaborative Language Model

Schick, Timo, Dwivedi-Yu, Jane, Jiang, Zhengbao, Petroni, Fabio, Lewis, Patrick, Izacard, Gautier, You, Qingfei, Nalmpantis, Christoforos, Grave, Edouard, Riedel, Sebastian

arXiv.org Artificial IntelligenceAug-24-2022

Textual content is often the output of a collaborative writing process: We start with an initial draft, ask for suggestions, and repeatedly make changes. Agnostic of this process, today's language models are trained to generate only the final result. As a consequence, they lack several abilities crucial for collaborative writing: They are unable to update existing texts, difficult to control and incapable of verbally planning or explaining their actions. To address these shortcomings, we introduce PEER, a collaborative language model that is trained to imitate the entire writing process itself: PEER can write drafts, add suggestions, propose edits and provide explanations for its actions. Crucially, we train multiple instances of PEER able to infill various parts of the writing process, enabling the use of self-training techniques for increasing the quality, amount and diversity of training data. This unlocks PEER's full potential by making it applicable in domains for which no edit histories are available and improving its ability to follow instructions, to write useful comments, and to explain its actions. We show that PEER achieves strong performance across various domains and editing tasks.

computational linguistic, language model, wikipedia, (15 more...)

arXiv.org Artificial Intelligence

2208.11663

Country:

North America > United States > California > Los Angeles County > Inglewood (0.28)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
(16 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Robust Semi-Supervised Classification using GANs with Self-Organizing Maps

Fick, Ronald, Gader, Paul, Zare, Alina

arXiv.org Artificial IntelligenceOct-19-2021

Generative adversarial networks (GANs) have shown tremendous promise in learning to generate data and effective at aiding semi-supervised classification. However, to this point, semi-supervised GAN methods make the assumption that the unlabeled data set contains only samples of the joint distribution of the classes of interest, referred to as inliers. Consequently, when presented with a sample from other distributions, referred to as outliers, GANs perform poorly at determining that it is not qualified to make a decision on the sample. The problem of discriminating outliers from inliers while maintaining classification accuracy is referred to here as the DOIC problem. In this work, we describe an architecture that combines self-organizing maps (SOMs) with SS-GANS with the goal of mitigating the DOIC problem and experimental results indicating that the architecture achieves the goal. Multiple experiments were conducted on hyperspectral image data sets. The SS-GANS performed slightly better than supervised GANS on classification problems with and without the SOM. Incorporating the SOMs into the SS-GANs and the supervised GANS led to substantially mitigation of the DOIC problem when compared to SS-GANS and GANs without the SOMs. Furthermore, the SS-GANS performed much better than GANS on the DOIC problem, even without the SOMs.

artificial intelligence, machine learning, node, (15 more...)

arXiv.org Artificial Intelligence

2110.10286

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.36)

Add feedback